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Abstract 

With the Dodgson rule, cloning the electorate can change the winner, which 
lYound l| 19771) considers an "absurdity". Removing this absurdity results in a new 
rule l|Fishburnj Il977l ) for which we can compute the winner in polynomial time 
(|Rothe et all l2003h . unlike the traditional Dodgson rule. We call this rule DC and 
introduce two new related rules (DR and D&). Dodgson did not explicitly propose 
the "Dodgson rule" (|Tidemanl . [l987l ); we argue that DC and DR are better realiza- 
tions of the principle behind the Dodgson rule than the traditional Dodgson rule. 
These rules, especially D&, are also effective approximations to the traditional Dodg- 
son's rule. We show that, unlike the rules we have considered previously, the DC, DR 
and D& scores differ from the Dodgson score by no more than a fixed amount given 
a fixed number of alternatives, and thus these new rules converge to Dodgson under 
any reasonable assumption on voter behaviour, including the Impartial Anonymous 
Culture assumption. 



1 Introduction 



Finding the Dodgson winner to an election can be very difficult, iBartholdi et al. (1989) 
proved that determining whether an alte rnative is the Dodgson winner is an NP-hard prob- 
lem. Later iHemasnaandra et al. (1997) refined this result by showing that the Dodgson 
winner problem was complete for parallel access to NP, and hence not in NP unless the 
polynomial hierarchy collapses. This result was of interest to computer science as the pre- 
viously known problems in this complexity class were obscure by comparison. 

For real world elections we do not want intractable problems. iTidemanl (|l987t ) proposed 
a simple rule to approximate the Dodgson rule. The impartial culture assumption states 
that all votes are independent and equally likely. Under this assumption, it has been proven 
that the probability that Tideman' s rule picks the Dodgson winn er converges to one as the 



number of voters goes to infinity ( M c Cabe-Dansted et al. . 20071) . Our paper also showed 



that this growth was not exponentially fast. Two rules have been independently proposed 
for which this convergence is expone ntially fast, our Dodgson Quick (DQ) rule and the 
GreedyWinner algorithm proposed bv lHoman and Hemasnaandral (2005). It is usually easy 
to verify that t he DQ winner and GreedyW inner are the same as the Dodgson winner, 
a property that iHoman and Hemaspaandral formalise as being "frequently self-knowingly 
correct" . However the proofs of convergence depended heavily on the unrealistic impartial 
culture assumption. We shall show that under the Impartial Anonymous Culture (IAC) 
these rules do not converge. 

The importance of ensuring that statistical results hold on re asonable assumptions on 
voter behaviour is considered bv lProcaccia and Rosenschein (|2007t) . They define "determin- 
istic heuristic polynomial time algorithm" in terms of both the problem to be solved and a 
probability distribution over inputs. However they do not consider the issue of whether a 
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heuristic is self- knowingly correct. Thus we extend the concept of an algorithm being "fre- 
quently self- knowingly corr e ct" to allow particular probability distributions to be specified. 
Procaccia and Rosenscheinl (|2007l ) also propose "Junta" distributions; these distributions 



arc intended to produce problems that are harder than would be produced under reasonable 
assumptions on voter behaviour. Hence if it is easy to solve a problem when input is gen- 
erated according to a Junta distribution it is safe to assume that it will be easy under any 
reasonable assumption of voter behaviour. We will not use Junta distributions, but instead 
simply use that fact that even "ncck-and-neck" national elections are won by thousands of 
votes. We will also show that the rules we have considered previously, Tideman, Dodgson 
Quick etc., do not converge to Dodgson's rule under the IAC. 

However, the reason that the Dodgson rule is so hard to compute is because cloning 
the electorate can change the winner. That is, if we replace each vote with two (or more) 
identical votes, this may change the winner. When discussing majority voting I Yound (|l977l ) 
described this property as an "absurdity" . Young suggested tha t such absurditie s be fixed 
in majority voting by allowing fractions of a vote to be deleted. iFishburnl (|1977l ) proposed 
a similar modification to the traditional Dodgson rule, which we call Dodgson Clone. The 
Dodgson Clone scor es can be computed by relaxing the integer constraints on the Integer 
Linear Program that lBartholdi et al.1 (jl989l) proposed to calculate the Dodgson score; normal 
(rational) Linear Programs can be solved in polyno mial time, we can compute the Dodgson 



Clone score in polynomial time ([Rothe et all 120031 ). 



The Dodgson Clone rule is also an effective approximation to the Dodgson rule. In 
computer science, an approximation typically refers to an algorithm that selects a value 
that is always accurate to within some error. This form of approximation is not meaningful 
when selecting a winner, although these rules can be u sed to appro ximate the frequency 
that Dodgson winner has some property. For example, IShahl (|2003f ) used Tideman's rule 
to approximate the frequency that the Dodgson winner matched the winners according to 
other rules, and so such Tideman, DQ etc. can be considered approximations of Dodgson's 
rule in a loose sense. However we can approximate the Dodgson score. We will show that for 
a fixed number of alternatives, the Dodgson Clone score approximates the Dodgson score to 
within a constant error. As it is implausible that the margin by which the winner wins the 
election will not grow with the size of the electorate, the Dodgson winner will converge to the 
Dodgson Clone winner under any reasonable assumption of voter behaviour. In particular 
we will show that they will converge under the Impartial Anonymous Culture assumption. 

We propose two closer approximations Dodgson Relaxed (DR) and Dodgson Relaxed and 
Rounded (D&). These approximations, like the traditional Dodgson rule, are not resistant 
to cloning the electorate. This allows them to be closer to the Dodgson rule than Dodgson 
Clone, both rules converge to the Dodgson rule exponentially quickly under the Impartial 
Culture assumption. The DR rule is superior to the Dodgson rule in the sense that it can 
split ties in favour of alternatives that are fractionally better. The D& scores are rounded 
up, so the D& rule does not have this advantage. However it is exceptionally close to 
Dodgson. In 43 million elections randomly generated according to various assumptions on 
voter behaviour, the D& winner diff ered from the Dodgson winner in only one election. 

The approximation proposed by IProcaccia et al. ( 2007 ) is similar to these approxima- 
tions in the sense that it involves a relaxation of the integer constraints. However their 
approximation is randomised, and thus quite different from our deterministic approxima- 
tions. Using a randomised approximation as a voting rule would be unusual, and they do 
not discuss the merits of such a rule. Thus the focus of their paper is quite different, as 
we present rules that we argue are superior to the traditional formalisation of the Dodgson 
rule. Additionally, they do not discuss the issue of frequently self-knowing correctness. 

Another approach to computing the Dodgson score has been to limit some parameter. 



Bartholdi et al.l (|1989» showed that computing the Dodgson scores and winner is polynomial 



when either the number of voters or alternatives is limited. It was shown that computing 
these from a voting situation i s logarithmic with respe ct to the number of voters when the 
number of alternatives is fixed (|M c Cabe-Danstedl . 120061) . and hence Dodgson winner is Fixed 
Parameter Tractable (FPT) with number of alternatives as the fixed parameter. It is now 
also known that the Dodgson w inner is FPT when the Dodgson score is taken as the fixed 
parameter ( Betzleri et al.l . [2008h . 

Thus we will define the Dodgson based rules in terms of Condorcet-tie winners, rather 
than Condorcet winners. As we will discuss briefly, this does not affect convergence. 



2 Preliminaries 

In our results we use the term agent in place of voter and alternative in place of candidate, 
as not all elections are humans voting other humans into office. For example, in direct 
democracy, the citizens vote for laws rather than candidates. 

We assume that agents' preferences are transitive, i.e. if they prefer a to b and prefer b 
to c they also prefer a to c. We also assume that agents' preferences are strict, if a and b are 
distinct they either prefer a to b or b to a. Thus we may consider each agent's preferences 
to be a ranking of each alternative from best to worst. 

Let A and J\f be two finite sets of cardinality m and n respectively. The elements of A 
will be called alternatives, the elements of N agents. We represent a vote by a linear order 
of the m alternatives. We define a profile to be an array of n votes, one for each agent . Let 
V = (Pi, P2, ■ ■ ■ , P n ) be our profile. If a linear order Pi e C(A) represents the preferences 
of the i th agent, then by aPib, where a, b £ A, we denote that this agent prefers a to b. 

A multi-set of linear orders of A is called a voting situation. A voting situation specifies 
which linear orders were submitted and how many times they were submitted but not who 
submitted them. A voting situation is sometimes referred to as a "succinct" election, as it 
can be represented succinctly when there only a few alternatives ( Faliszewski et al. . 20061) . 



The Impartial Culture (IC) assumption is that each profile is equally likely. The Impar- 
tial Anonymous Culture (IAC) assumption is that each voting situation is equally likely. To 
understand the difference, consider a two alternative election with billions of agents; under 
IC it is almost certain that each alternative will get 50% (±0.5%) of the vote; under IAC, 
50.0% is no more likely than any other value. 

Let V = (Pi, P2, ■ ■ ■ , P n ) be our profile. We define n xy to be the number of linear orders 
in V that rank x above y, i.e. n xy = #{i | xPiy}. 

Definition 2.1. The advantage of a over b is defined as follows: 

adv(a, b) = max(0, n a i, — nt> a ) 

A Condorcet winner is an alternative a for which adv(a, b) > for all other alternatives 
b. We define a Condorcet-tie winner, to be an alternative a such adv(b,a) = for all 
other alternatives a. A Condorcet winner or Condorcet-tie winner does not always exist. 

It is traditional to define the Dodgson score of an alternative as the terms of the minimum 
number of swaps of neighbouring alternatives required to make that alternative defeat all 
others in pairwise elections, i.e. make the alternative a Condorcet winner. When not 
requiring solutions to be integer this becomes undefined, as if we defeat an alternative by 
e > then there exists a better solution where we defeat the alternative by only e /2. 

For this reason, when defining the Dodgson scores we only require that the alternative 
defeat or tie other alternatives, i.e. make the alternative a Condorcet-tie winner. For better 
consistency with the more traditional Dodgson rule we could define the Condorcet winner 



as an alternative a for which adv(a, b) > 1. However this would mean that the Dodgson 
Clone rule would not be resistant to cloning of the electorate. 

This difference in definition does not affect convergence. Our proof of convergence re- 
lies only on fact that Dodgson, D&, DR and DC scores differ by at most a fixed amount 
(C(m!)) when the number of alternatives is fixed. To convert a Condorcet-tie winner c into 
a Condorcet winner c we need to swap c over at most (m — 1) alternatives, each requiring 
at most (m — 1) swaps of neighbouring alternatives. Hence the difference between the score 
according to these different definitions of Dodgson is at most (m — l) 2 . 

We will now define a number of rules in terms of scores. The winner of each rule below 
is the alternative with the lowest score. 

The Dodgson score ( Dodgsor] 18761 . see e.g. Blackl 1958 : Tideman 1987 ). which we 
denote as Scd(o), of an alternative a is defined as the minimum number of swaps of neigh- 
boring alternatives required to make a a Condorcet-tie winn er. We call the altern ative(s) 



19891) 



with the lowest Dodgson score(s) the Dodgson winner(s). (jBartholdi et al 
The Tideman score Sct(o) of an alternative a is: 

Sccj(a) = J]] adv(£>, a). 

b^a 

The Dodgson Quick (DQ) score Scq(g), of an alternative a is 

Sc Q (a) = F ( b , a ): whcr c F(b, a) = adv ( b ' a ) 

b^a 

Although the definitions of Scq and Sct are very similar, the Dodgson Quick rule con- 
verges exponentially fast to Dodgson's rule under the Impa rtial Culture assumption, where 



as Tideman's rule docs not (|M c Cabe-Dansted et al.l . l2007f ). This is because Dodgson and 



DQ are more sensitive to a large number of alternatives defeating a by a small odd margin 
(e.g. 1) than Tideman is. 

We define the fc-Dodgson score Sc^, (d) of a as being the Dodgson score of a in a profile 
where each agent has been replaced with k clones, divided by k. That is, where V is our 
fixed profile, V k is the profile with each agent replaced with k clones, and Sc-£,[P](a) is the 
Dodgson score of a in the profile V, then 



S Crj (a) =Sc^[V](a) 



Scn[P k )(a) 



We define the Dodgson Clone (DC) score Scc(a) of an alternative a as min^ Sc£)(a). 
The DC score can be equivalently defined by modifying the Dodgson rule to allow votes to 
be split into rational fractions and allowing swaps to be made on those fractions of a vote. 
Note that like the Tideman approximation, the DC score is less sensitive than Dodgson to 
a large number of alternatives defeating a by a margin of 1, so the DC score is unlikely to 
converge to Dodgson as quickly as DQ under the Impartial Culture assumption. 

We define the Dodgson Relaxed (DR) score Scr as with the Dodgson score, but 
allow votes to be split into rational fractions. However we require that a be swapped over b 
at least F(b, a) times. Thus ScR,(<i) > Scq (d) and the Dodgson Relaxed rule will converge at 
least as quickly as DQ. The DR rules thus sacrifices independence to cloning of the electorate 
to be closer to the Dodgson rule than DC. 

The Dodgson Relaxed and Rounded (D&) score Sc& is the DR score rounded up, 
i.e. Sc & (d) = rSc R (d)l. 



3 Dodgson Linear Programmes 



iBartholdi et al.l (|l989h defined an ILP for determining the Dodgson score of a candidate as 
follows: 

niin V ( ./.r, , subject to 
J2j x ij = Ni (for each type of vote i) 
e ijkXij > dk (for each alternative k) 

> 0, and each must be integer, where the variables and the candidate 

d is swapped up j positions i times in votes of type i. The constant iVj represents the 
number of votes of type i. The constant eijk is 1 if moving d up j positions in vote type i 
raises d above fc, and otherwise. The dk is the "minimum number of votes" that d must 
gain to defeat k. This definition opens a potential ambiguity: is dk = adv(fc, d)/2 or is 
dk = F(k,d) = \a,dv(k, d)/2] . This ambiguity does not affect the ILP as both definitions 
are equivalent when we require that the variables be integer. However when we relax the 
requirement that the varia bles be integer, we get the DC score if dk = &dv(k,d)/2 and the 
DR score if d k = F(k,d). IBartholdi etafl (|l989l) note that there are only to! orderings of 



the alternat ives and thus no more th an ml vote types. However, since we never swap d down 
the profile ( M c Cabe-Danstedl . 20061 ). the ordering of the candidates below d are irrelevant. 



We will formalise this notion as d-equi valence: 

Definition 3.1. Where v is a linear order on to alternatives, let represent the i th highest 
ranked alternative and v<i represent the sequence of i highest ranked alternatives. Where 
d is an alternative, we say v and w are d-equivalent (v wj iff there exists i such that 
v t = d and v< ; = w< ; . 

Lemma 3.2. Let Sd be the set of d- equivalence classes. Then \Sd\ is less than (to — l)!e 
where e = 2.71 ... is the exponential constant. 

Proof. We see that there is one equivalence class where d is ranked in the top position, 
to — 1 equivalence classes where d is ranked in the second highest position, and in general 
rifc^i-i+i & when d is ranked i th from the top. We note that: 

(m-1)! 



n k 



(m — i)\ 

k=m—i+l v ' 

We see that 

(m-1)! (m-1)! (to. -1)! 



\S d \ 



(m — m)! (to — (m — 1))! (m — 1)! 

\t ( 1 1 1 

:(m-l)! — + — H h 



0! 1! (m-1)! 
<(m-l)! rI + I + ...J =(m-l)!e □ 

Corollary 3.3. // we categorise votes into type based on d-equivalence classes (instead of 
linear orders), the ILP below has less than to (to — l)!e = m\e variables. 

Note that there are less than (to — l)!e choices for i, no more than m choices for j and 
thus less than m (to — l)!e = m\e variables (each of the form yij). 

Lemma 3.4. We can transform the ILP into the following form \M C Cabe-DansteA . 200 A ): 

m ^ n J2iJ2i>oyi3 subject to 

i/ia = Ni (for each type of vote i ) 



Y,ij{ e ijk ~ e-i{j-i)k)yij > D k (for each alternative k) 
Uij < Vi(j-i) (for each i and j > 0) 
Uij > 0, and each j/y must be integer. 

Proof. For each i and j variable j/y represents the number of times that the candidate d is 
swapped up at least j positions. In the LP Dk may be defined as adv(fc, d)/2 to compute the 
DC score or [~adv(fc, d)/2] to compute the DR score (under the LLP these are equivalent). 
Thus, 

x ij — Vi{j—"i) arL0 - Vij ~ Sfe>j x ik- Using these two equalities it is easy to derive the 
above ILP. □ 

Theorem 3.5. The DR (Sen), DC (Sec) and D& (Sc&) scores are bounded as follows: 

Sc D (d) - (m - l)!(m - l)e < Sc c (d) < Sc R (d) < Sc& < Sc (d) 

Proof. Every solution to the Integer Linear Program for the Dodgson score is a solution 
to the Linear Program for the DR score. Every solution to the LP for the DR score is a 
solution to the LP for the DC score. Thus the DC score cannot be greater than the DR 
score, which cannot be greater than the Dodgson score (Scc(d) < Scr(c?) < Scn(d)). Since 
Scd is integer, it follows that Scr(c£) < |"Scr(c£)] = Sc&(d) < Scu(d). Also note that given 
a solution y to either LP, we can produce a solution y' to the ILP simply by rounding up 
each variable (y'^ = [j/y]), hence 

Sc D (d) - Scc(d) < E E rWil - Vij < E E 1 ^ ijrt - ly.e 

i j>0 i j>0 

Since i can take less than (m — l)!e values, and j can vary from 1 to (m — 1), it follows 
that 

Sc D (d) - (m - l)!(m - l)e < Sc c (d) < Sc R {d) < Sc & < Sc D (d) □ 

We can transform the (I)LP into a form wi th less than M = (m — l)!e va riables an d 
encoded in L E O [(m - 1)! In ((m - l)!n)] bits (|M c Cabe-Danstedl . l2006h . Using iGonzaeal 's 



algorithm we may solve LPs in 0(M 3 L) arithmetic operations of 0(L) bits of precision. 
From these two facts we get the following corollary: 

Corollary 3.6. We can compute the DC. DR and D& scores with 



operations of 



bits of precision. 



O ((m- l)!) 4 ln((m- l)!n) 



0[(m-l)!ln((m-l)!n)] 



This suggests that computing these scores for a 100 million agent, 5 alternative profile 
would be a trivial task for a modern computer. Hence these rules could be used as a real 
world approximations to the Dodgson rule. 

We also note that there cannot be more vote types than votes, hence there are at most 
mn variables, and the LP can be encoded in 0(mn) bits. Hence we can compute these 
scores in C?(m 4 n 4 ln(mn)) arithmetic operations requiring 0(mn\n(mn)) bits of precision. 
Hence computing the DC, DR and D& scores requires only polynomial time. 



3.1 Empirical Results. 



In practice the major difference between DR and Dodgson i s that the PR rule picks a smaller 
set of tied winners. We performed 43 million simulations ( M c Cabe-Danstedl . 12006 ) with up 
to 25 alternatives and 85 candidates and assuming various amounts of homogeneity, modeled 
using the PE distribution, in the population. We found only 25 cases where DR picked a 
different set of tied winners to Dodgson, and only two cases where the set of tied DR winners 
was not a subset of the Dodgson winners. As the DR scores are fractions, the DR rule can 
split ties according to which are fractionally better, which may be considered to be more 
democratic than e.g. splitting ties according to the preferences of the first agent. In this 
sense DR is superior to the Dodgson rule. By rounding the DR scores up, we get a new rule 
that approximates Dodgson yet more closely, picking the Dodgson winner in all 43 million 
simulations except for a single profile with 25 alternatives and 5 agents generated under 
the Impartial Culture assumption. 10,000 such profiles were generated, and by comparison, 
both the Tideman and DQ approximations differed from Dodgson in 13% of these profiles. 

The similarity between DR and Dodgson suggests a reason why we were able to compute 
the Dodgson winners for non-trivial populations in milliseconds. LPs tend to be easy to 
solve, the NP-hardness of ILPs comes from the difficulty in finding integer solutions. Since 
the LP solutions tend to be very close to the integer solutions, finding the integer solution 
from the LP solution is often trivial. 



4 Counting Proof of Convergence under IAC 

For a voting situation U and linear order v, we represent the number of linear orders of 
type v in U by #t/(v). 

Where X € {D,T}, let Ax (a,z) be equivalent to Scx(a) — Scx(z)- Given an arbitrary 
pair of alternatives (a, z) we pick an arbitrary linear order ab . . . z with a ranked first and z 
ranked last and call it v. We also define the reverse linear order v = z . . . ba. 

Lemma 4.1. Replacing a vote of type v with a vote of type v will increase At (a, z) by at 
least one. 

Proof. We see that replacing a vote of type v with a vote of type v will increase adv(a, z) 
by one, or decrease adv(z, a) by one. □ 

Lemma 4.2. Replacing a vote of type v with a vote of type v will increase Ad(o,z) by at 
least one. 

Proof. Say a is not a Condorcet-tie winner, but is a Condorcet-tie winner after some minimal 
set S of swaps is applied to the profile P. Let P' be the profile P after one vote of type 
v has been replaced with a vote of type v. If any swaps were applied to the vote those 
swaps are no longer required, and Scd[-P'](o) < Sco[-P](a). Otherwise we can apply the set 
of swaps S to P' resulting in adv(a, k) being at least 2 for all other alternatives k. Hence 
we can remove one of the swaps, and still result in a being a Condorcet-tie winner after the 
swaps have been applied to P' . 

Say a is a Condorcet-tie winner in P. Then z is not a Condorcet-tie winner in P' . As 
in the previous paragraph we can conclude that Scjj[P](z) < Scr>[P'](z). □ 

Lemma 4.3. For a fixed integer k, and a fixed ordered pair of alternatives (a, z) the pro- 
portion of voting situations, with n agents and m alternatives, for which Ax (a, z) = k is no 
more than: 

(m! - 2) 
(n + ml - 2) 



Proof. Say v = ab . . . z is some fixed linear order and v = z . . . ba is the reverse order. We 
define an equivalence relation ~ on the set of voting situations, as follows: say U, V are two 
voting situations, then 

U ~ V <==^ V w ^ v . w ^v#y(w) = #[/(w). 

From Lemma 14.11 and 14.21 we see that in each equivalence class, there can be at most one 
voting situation for which Ax (a, z) = k . Also note that whereas there are 



\S n (A)\ = 



n + ml — 1 

n 



distinct voting situations there are at most 

'n + (ml - 1) - 1 



equivalence classes under ~. Hence the proportion of voting situations for which Ax(a, z) = 
k is no more than: 

(n + m!-l)! n!(m!-2)! _ (n + ml - 1)1 (ml - 2)1 _ (ml - 2) 
n!(m!-l)! (n + m!-2)! ~ (n + ml - 2)1 (ml - 1)! _ (n + m! - 2) 

Lemma 4.4. J/ At(o, z) = A: anrf a is a Tideman winner and z is a DQ winner, then 

< k < 771. 

Proof. As a is a Tideman winner and z is a DQ winner, then 

Sc T (a) < Sc T (2), Sc Q (z) < Sc Q (a) 

Recall that 

"adv(y,x)" 



Sc Q (x) = X: , Sc T (x) = ^adv( y 



We see that adv(y,x) < 2 \adv(y, x)/2~\ < a,dv(y,x) + 1 and so Scx(a;) < 2Scq(x) < 
Scx(a;) + 777 for all alternatives x. Thus 

Sc T (z) < 2Sc Q (z) < 2Sc Q (a) < Sc T (a) + m. 

And so Scx(a) < Scx(z) < Scx(a) +m. Let k = Scx(a) — Scx(z). Then < k < m, and so 
there are no more than m ways of choosing k if we wish the DQ and Tideman winners to 
differ. □ 



Recall that Theorem 13 . 5 1 states : 

Sc D (d) - (m - l)!(m - l)e < Sc c (d) < Sc R (d) < Sc& < Sc D (d) 

where e = 2.71 ... is the exponential constant. Given that there are only 777 ways of choosing 
k such that the Tideman and DQ winners differ, and less than (to — 1)! (tti — l)e ways of 
choosing k such that the Dodgson, DC, DR and/or D& winners differ, we get the following 
theorem. 



Theorem 4.5. The proportion of voting situations, with n agents and m alternatives, for 
which a is a Tideman winner and z is a DQ winner is no more than: 

(ml - 2) 

-m. 



(n + ml - 2) 



The proportion for which a is Dodgson winner and z is a DC, DR and/or D& winner is 
less than 

(m! -2) . 
, - , ' m-1 ! m-1 e. 
(n + ml — 2) 

Theorem 4.6. The proportion of voting situations, with n agents and m alternatives, for 
which the Tideman winner differs from the DQ winner is no more than: 



(n + ml — 2) 



(m — l)m 



The proportion where the Dodgson winner differs from the DC, DR, and/or D& winners is 
less than 



2) 



m!(m — 1) e. 



Proof. Obvious, as there are only m(m — 1) ways of choosing a and z from the set of 
alternatives. □ 

Corollary 4.7. The probability that the DQ and Tideman rule pick the same winners 
converges to I as n — > oo, under the Impartial Anonymous Culture assumption. Likewise 
the probability that the DR, DC, D& and Dodgson rules pick the same winner converges to 
1 as n — >• oo. 

In other words t he DR, DC and Dfc winners (and s cores) provide "deterministic heuristic 
polynomial time" ( Procaccia and Rosenscheirl 2007 ) algorithms for the Dodgson winner 
(and score) with the IAC dist ribution. We can use the same tec hnique to show that the 
Greedy Algorithm proposed bv lHoman and Hemaspaandra ( 2005 ) converges to the DQ and 
Tideman rules under IAC, as the GreedyScore differs from the DQ score by less than m.By 
setting k to we may likewise prove that the probability of a non-unique Tideman or 
Dodgson winner converges to under IAC. 

We extend the conce pt of a "frequently self-knowingly correct algorithm" 

(jHoman and Hemaspaandra . 2005) such that we can specify a distribution over which the 
algorithm is frequently self-knowingly correct. 

Definition 4.8. A self-knowingly correct \Homan and Hemasvaandrd . 200 A ) algorithm A 
is a ''frequently self-knowingly correct algorithm over a distribution fi" for g: E* — > T iff 

lim V P M (x) = 

.tGS", A(x)£T X {maybe} 

where Pfi{x) is the probability that X = x when X is chosen from E™ under the \x distribution 
and for all x we have (A(x))i = g(x) or (A(x))2 — "maybe". 

Using the set of linear orders C(A) as E we may construct a frequently self-knowingly 
correct algorithm from DC (or DR, or D&) as the algorithm can output "definitely" when- 
ever the DC winner has DC score that is at least (m — l)!(m — l)e less than any other 
alternative. 



5 Non-convergence of Tideman Based rules 



Under the Polya-Eggenbcrger (PE) urn model we start with a non-negative integer a and 
an urn containing balls, each of a different colour. To generate each random sample, we pull 
a ball out of the Urn at random and note its colour. After removing each ball, we return 
the ball that was taken to the urn together with a additional balls of the same colour to the 
urn. Our sample is the ball taken from the urn. 

If a = 0, PE generates a binomial or multinomial distribution and these distributions 
converge to the standard distribution as the number of samples increase to infinity. As the 
number of samples increase to infinity, it is almost certain that the ratio of the balls drawn 
will match the ratio of the balls in the urn. 

If a > the ratio of balls in the urn varies. We see that for any x € (0, 1) and e > 
there is a finite set of draws which will result in the ratio of balls of in the urn ratio being 
within the range (x — e,x + e). Because of this, the samples drawn from the urn may 
converge on any ratio as the number of balls drawn approaches infinity. More formally, the 
PE dist ribution conv erges to the Beta distribution as the number of samples approaches 



infinity (jFellerl . 119711) , thus the limiting probability of the ratio falling within any open set 
is non-zero. 

For generation of random profiles, we replace colours with linear orders. The parameter a 
characterises homogeneity; for a = we obtain the well known Impartial Culture assum ption 



and for a = 1 the Impartial Anonymous Culture assumption (jBerg and Lepellvl . Il994f ). 
Definition 5.1. A "voting ratio" is a junction f : C(A) — > [0, 1] such that 

E /(v) = i- 

veC{A) 

We say that a profile V reduces to a voting ratio f if 

Note 5.2. A voting ratio is similar to a voting situation, but unlike a voting situation does 
not contain any information about the total number of agents. 

Definition 5.3. We say that a voting ratio f is "bad" if for every profile V that reduces to 
f and has an even number of agents, the DQ winner ofV differs from the Dodgson winner. 



Example 5.4. The following voting ratio is bad 

<?(v) = 



7 /is if v = abode 

6 /i8 if v = cdabe 

5 /i8 if v = bcead 

otherwise 



For any profile with I8n agents that reduces to the above voting ratio, the DQ and Dodgson 
score of c will be 3n; the DQ score of a will be 2n and the Dodgson score of a will be 4n. 
Hence a will be the DQ winner but c will be the Dodgson winner. 

Example 5.5. The following voting ratio is bad. 

Mv) = < 



16 /39 if v = obex 
12 /39 if v = cxab 



10 /39 if v = bcxa 
Y39 if v = cbax 
otherwise 



We see that for a profile that reduces to the above voting ratio, we have the following advan- 
tages and scores per 78 agents: 



1,1,1,1 



.mine 
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a 


b 


c 


X 


DQ score 


12 


17 


13 


54 


Dodgson score 


14 


17 


13 


54 
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The Dodgson score of a is higher than the DQ score of a as to swap a over c we must 

first swap a over b or a over x. We have to swap a over c at least 7 times. Hence we must 

use a total of at least 14 swaps to make a a Condorcet-tie winner. We see that a is the DQ 

winner, but c is the Dodgson winner. 

We will now show that there exists a neighbourhood around the voting ratios g and h 

that is bad. 

Lemma 5.6. Altering a single vote will change the Dodgson scores and DQ scores by at 
most m — 1 . 



Proof. Recall that 



Scq(o) 



F(b,a), where F(b, a) 



adv(6, a) 



We see that for each other alternative b changing a single vote can change F(b, a) by at 
most one, and there are m — 1 such alternatives. 

Say that P and R are two profiles that differ only in a single vote. Let P' and R' be 
P and R respectively after some arbitrary d has been swapped to the top of the vote that 
differs, which requires no more than m — 1 swaps. We see that 



Sc D [P](rf) < Sc D [i"](d) 
Sc D [J?](d) < Sc D [R'}(d) 



m 
m 



1 < SC D [#](<*) 

1 < Sc D [P](d) 



m 
m 



□ 



Corollary 5.7. For any positive integer k, alternative d, profile P and rule X 6 {D,Q} 
(i.e. Dodgson or DQ), if Sc^(d) < 5cx(a) — 2k(m — 1) for all other alternatives a, then d 
will remain the unique X winner in any profile that results from changing k or less votes. 

Lemma 5.8. There is a neighbourhood of bad voting ratios around the voting ratio h from 
Example 15.51 

Proof. We see that in a profile with n agents that reduces to the voting ratio h the Dodgson 
and DQ winners have scores that are at least lower than the other alternatives. Thus 
if we alter less than ^fg 2(i)(4-i) = its v °t es i t ne DQ winner will remain different from the 
Dodgson winner. □ 

Theorem 5.9. If we generate profiles randomly according to the PE distribution, with a > 
and abcx, cxab, bcxa, cbad existing in the urn, the probability that the DQ winner is not 
the Dodgson winner will converge to some non-zero value as the number of agents tends to 
infinity. 



Proof. The PE distribution converges to the Beta distribution. Given any neighbourhood, 
the probability that a variable distributed according to a non-trivial beta distribution will 
fall within that neighbourhood is non-zero. There is a neighbourhood of bad voting ratios 
and any profile that reduces to a bad voting ratio has a DQ winner that is different to the 
Dodgson winner. □ 

We can generalise this result by adding alternatives that lose to all of a, b, c and x. As 
the Impartial Anonymous Culture assumption is a special case of the PE distribution with 
a = I and the urn starting with an equal number of all linear orders, these approximations 
also do not converge under the IAC assumption. Thus we may now state the following 
theorem: 

Theorem 5.10. If we generate profiles randomly according to the IAC distribution, with 
m > 4, the DQ, Greedy and Tideman winners do not converge to the Dodgson winner as 
the number of agents tends to infinity. 



Under the IAC , the Tideman rule (|M c Cabe-Dansted et all 120071 ) and the greedy algo- 
rithm proposed by Homan and Hemaspaandra ( 2005h converges to the DQ rule, which does 
not converge to Dodgson's rule. It follows that the Tideman rule and greedy algorithm do 
not converge to Dodgson's rule under the IAC. 

As the difference betwe en the DQ scores and the GreedyScores 
(jHoman and Hemaspaandra! . 120051) is less than m, the DQ winner and GreedyWinner ijjjjjj 
.mine will converge under the IAC. ======= will converge under the IAC. IHIIH .r!76 



6 Conclusion 

We have previously (lM c Cabe-Dansted et all 120071) proved that Tideman's rule does con- 
verge under IC, and presented a refined rule (DQ) which converged exponentially quickly. 
Another approximation with expo nentially fast convergence was independently proposed by 
Homan and Hemaspaandra (|2005h . 

Unfortunately, real voters are not independent, and so the Impartial Culture assumption 
is not realistic. This paper investigates the asymptotic behaviour of approximations to 
the Dodgson rule under other assumptions of voting behaviour, particularly the Impartial 
Anonymous Culture (IAC) assumption, that each multiset of votes is equally likely. 

We found that the approximations converge to each other under IAC, but they do not 
converge to Dodgson. Hence the DQ rule is not asymptotically closer to the Dodgson rule 
than the Tideman rule is, although DQ converges faster under the IC. 

It is not realistic to assume that voters' behaviour will precisely follow any mathematical 
model, including IAC. Fortunately the proof that our new approximations converge to 
Dodgson does not depend on the details of the IAC. Indeed, the DC, DR, D& and Dodgson 
winners will all be the same if the scores of the two leading alternatives differ by less than 
(m — l)!(m — l)e. This assumption is realistic for a sufficiently large number of voters. Even 
in the neck and neck 2000 US presidential elections, the popular vote for the leading two 
alternatives differed by half a percent — over half a million votes. If we used the Dodgson 
rule to choose between the top five alternatives, the difference between the Dodgson scores 
of the leading two alternatives would have to be less than 2G1 for the winners to differ. Even 
if the entire electorate conspired to cause the winners to differ, minor inaccuracies such as 
hanging chads could frustrate their attempt. Even in cases where the Dodgson Relaxed docs 
differ from the Dodgson, the difference seems to be primarily that the Dodgson Relaxed rule 
picks a smaller set of tied winners as it is able to split ties in favour of alternatives that 
have fractionally better DR scores. This is in some sense more democratic than tie breaking 
procedures such as breaking ties in favour of the preferences of the first voter. 



Thus determining that an a lgorit hm is "frequently self-knowingly correct" , as denned 
by iHoman and Hemaspaand"ra ( 2005t ). is insufficient to conclude that the algorithm will 



converge in practice. The DQ and GreedyWinner provide frequently self-knowingly correct 
algorithms, but do not converge under other assumptions of voter behaviour such as IAC. We 
have extended the definition to allow a distribution to be specified. So, in other words, DQ 
and GreedyWinner provide algorithms that are "frequently self-knowingly correct over IC" , 
but unlike DC, DR, and D&, do not provide algorithms that are "frequently self-knowingly 
correct algorithms over IAC" . 

We have previously shown that the Dodgson scores and winners of a voting situa- 
tion can be computed from a v oting situation with Q( fi (to) In n) arithmetic operations 
of 0(/2(m) In n) bits of precision ( M c Cabe-Danstedl . 2006t ) for some pair of functions f± and 



f2- However we did not find a good upper bound on /i or /2, even /i(4) may be unrea- 
sonably large. In contrast, we found that we may compute the DC, DR and D& score with 



(9 



((to - l)!) 4 ln((m- l)!n) 



operations of 
0[(m- l)!ln((m- l)!n)] 



bits of precision. Although ((m — l)!) 4 is quite large, a modern computer can perform 
billions of elementary operations a second. This result suggests that even the worst case 
time for billions of voters and 6 or 7 a lternatives is feasible. Also, whereas DodgsonScore 
is NP-complete dBartholdi et al. . 19891) . we can also compute the DR and D& scores using 



C(m 4 n 4 ln(mn)) arithmetic operations requiring C(mnln(mn)) bits of precision. 

We have found that the scores of the most of the approximations we have studied form 
a hierarchy of increasingly tight lower bounds on the Dodgson score: 

Sc«(a;) Sct(^) „ „ „ „ „ / ..% 
< — y 1 < Sc Q < Sc R < Sc& < Sc D < Sc R + (to - 1)!(to - l)e 

The Dodgson Clone rule does not fit in that hierarchy, although it is the case that 
^SlM. < Sc c < Scr < Sc D < Sc c + (to - 1)!(to - l)e . 

Despite the great accuracy of the D& approximation, there are good reasons to pick other 
approximations. The difference between the DR rule and the D& rule is that the DR rule can 
split ties based on fractional scores, so the DR rule may be considered superior to D& and 
Dodgson's rule. The DC rule is resistant to cloning of the electorate. The DQ rule is very 
simple to compute, and very easy to write in any programming language. As the DQ rule is 
known to converge exponentially fast under IC, this makes the DQ rule very appropriate for 
cases where the data is known to distributed according to IC. This is the case for studies that 



have randomly generated data according to the IC (see e.g. lM c Cabe-Dansted and Slinko 



200d IShahl . l20oi iNurmil . Il983f ). The Tideman rule is no more easy to compute than the 



DQ rule. However, the mathematical definition of the Tideman rule is simpler than the 
DQ rule. This makes the Tideman rule useful for theoretical studies of the Dodgson rule 
where the speed of convergence is not important. Also, like the DC rule, the Tideman rule 
is resistant to cloning the electorate. 

We conclude that the DC and DR rules are superior, for social choice, to the traditional 
definition of the Dodgson rule. DC provies resistance to cloning the electorate, and both 
are better at splitting tics than the traditional definition of Dodgson rule. We know of 
no advantage of the traditional definition over these rules. Even if the traditional Dodgson 
winner is preferred, it may be hard to justify the computational complexity of the traditional 
Dodgson rule, especially since it is almost certain that these rules would pick the same result. 
If, despite all this, the traditional Dodgson rule is still chosen, these new rules provide 



frequently self-knowingly correct algorithms for the Dodgson winner for any reasonable 
assumption of voter behaviour, including IAC. 
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